Vbert #339

QuentinJGMace · 2025-09-29T09:56:35Z

adds vbert
fixes training with multiple hardnegs

Still to do:
Modify all negatives loss, not just the ones used for vbert

* modeling * update modeling * update token id default * init files * remove vllama + update torch lower bound for cpu * back to normal transformer bound * clean * Update colpali_engine/models/__init__.py --------- Co-authored-by: QuentinJGMace <[email protected]>

mlconti1

Mostly comments about the form, overall LGTM!

mlconti1 · 2025-10-17T15:35:16Z

colpali_engine/__init__.py

-    ColQwen2_5Omni,
-    ColQwen2_5OmniProcessor,
+    # ColQwen2_5Omni,
+    # ColQwen2_5OmniProcessor,


Add comment to the README if ColQwen 2.5 Omni is not supported anymore

colpali_engine/collators/collator_copy.py

mlconti1 · 2025-10-17T15:41:53Z

colpali_engine/collators/visual_retriever_collator.py

+
        # Process queries.
-        queries = [self.processor.query_prefix + q + self.processor.query_augmentation_token * 10 for q in queries]
+        # queries = [self.processor.query_prefix + q + self.processor.query_augmentation_token * 10 for q in queries]


remove commented lines if not useful

actually usefull, in modernvbert self.processor.query_prefix is "" but it is useful if somebody wants to reproduce other older models.
Thanks for flagging it out !

mlconti1 · 2025-10-17T15:44:01Z

colpali_engine/collators/visual_retriever_collator.py

        # Process queries.
-        queries = [self.processor.query_prefix + q + self.processor.query_augmentation_token * 10 for q in queries]
+        # queries = [self.processor.query_prefix + q + self.processor.query_augmentation_token * 10 for q in queries]
+        queries = [q + self.processor.query_augmentation_token * 10 for q in queries] if is_str else queries


put 10 into a constant (e.g. N_AUGMENTATION_TOKENS)

mlconti1 · 2025-10-17T15:51:19Z

colpali_engine/collators/visual_retriever_collator.py

+                else:
+                    proc_batch[k] = v


unnecessary

mlconti1 · 2025-10-21T15:42:42Z

colpali_engine/trainer/contrastive_trainer.py

-        query_outputs = model(input_ids=inputs["query_input_ids"], attention_mask=inputs["query_attention_mask"])
+        query_outputs = model(**{k[6:]: v for k, v in inputs.items() if k.startswith("query")})
        # feed only kwargs with 'doc_' prefix
        doc_outputs = model(**{k[4:]: v for k, v in inputs.items() if k.startswith("doc")})


define var/constant for len("doc:")

mlconti1 · 2025-10-21T16:13:09Z

colpali_engine/trainer/contrastive_trainer.py

+        """
+        Helper function to reshape negative doc inputs to (batch_size * num_neg_docs, ...)
+        """
+        neg_doc_inputs = {k[8:]: v for k, v in inputs.items() if k.startswith("neg_doc")}


define var/constant for 8

mlconti1 · 2025-10-21T16:16:12Z

tests/loss/test_bi_losses.py

could rename variables for more clarity and use constants, and add doc

mlconti1 · 2025-10-21T16:16:44Z

tests/loss/test_li_losses.py

save as test_bi_losses

mlconti1 · 2025-10-21T16:17:11Z

tests/models/modernvbert/test_modeling_colmodernvbert.py

+        assert scores.shape == (len(ds), len(ds)), f"Expected shape {(len(ds), len(ds))}, got {scores.shape}"
+
+        # # Check if the maximum scores per row are in the diagonal of the matrix score
+        # assert (scores.argmax(dim=1) == torch.arange(len(ds), device=scores.device)).all()


why is this commented out?

paultltc and others added 29 commits September 29, 2025 12:07

update processors to new signatures

fc27f3c

lint

31c0709

keep process_queries for back comp

32de63c

add vbert/vllama modeling

d00b267

stage

7fba1c6

fix typo in vbert modeling

43d3d36

loss

ed11060

models

55ebd0c

losses

4ddc453

symetric loss + flex biencodr score

e54df49

process

0375d68

merge

2ab0cb0

fix dup

00337b1

latest

ec4d4dd

modeling

91e9f36

f

91ba4be

rebase

81eef80

rebase

9a82c1f

rebase

245bb33

symetric loss + flex biencodr score

2ebe2ab

f

44fe1e6

f

1ec65fc

f

1b8510f

remove bvbert file

3dfbe4b

negatives loss

d748aa1

prepare collators for multi-hardnegs

afd0e95

multiple hard negs training

dcbbe15

f

24cd010

rm colqwen_omni init

5c11cd3

QuentinJGMace force-pushed the vbert branch from e77142f to 5c11cd3 Compare September 29, 2025 11:51

QuentinJGMace and others added 13 commits September 30, 2025 09:53

f

da868ae

modif tests

fa1ea76

Change default model

3fb3df4

Change default text model name in configuration

31630d1

add tests for modernvbert

20e78cc

f test

43fba98

f

058a299

ff

d1e3f38

update dtype assign (#349)

133bc51

oopsie

df0d1a8

update other losses

8c89c49

correct tests to handle multiple neg

c6d4dd0

mlconti1 approved these changes Oct 21, 2025

View reviewed changes

This comment was marked as resolved.

Sign in to view

QuentinJGMace mentioned this pull request Oct 22, 2025

Snappy NaN activations with ColModernVBERT #352

Closed

Vbert #339

Are you sure you want to change the base?

Vbert #339

Uh oh!

Conversation

QuentinJGMace commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mlconti1 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

QuentinJGMace commented Sep 29, 2025 •

edited

Loading